BlogPulse: Automated Trend Discovery for Weblogs

نویسندگان

  • Natalie S. Glance
  • Matthew Hurst
  • Takashi Tomokiyo
چکیده

Over the past few years, weblogs have emerged as a new communication and publication medium on the Internet. In this paper, we describe the application of data mining, information extraction and NLP algorithms for discovering trends across our subset of approximately 100,000 weblogs. We publish daily lists of key persons, key phrases, and key paragraphs to a public web site, BlogPulse.com. In addition, we maintain a searchable index of weblog entries. On top of the search index, we have implemented trend search, which graphs the normalized trend line over time for a search query and provides a way to estimate the relative buzz of word of mouth for given topics over time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Survey on Perception of People Regarding Utilization of Computer Science & Information Technology in Manipulation of Big Data, Disease Detection & Drug Discovery

this research explores the manipulation of biomedical big data and diseases detection using automated computing mechanisms. As efficient and cost effective way to discover disease and drug is important for a society so computer aided automated system is a must. This paper aims to understand the importance of computer aided automated system among the people. The analysis result from collected da...

متن کامل

Indexing Weblogs One Post at a Time

In order to perform analysis over weblogs, we must first identify the appropriate unit of a weblog that corresponds to a document. We argue in the paper that, for weblogs, the correct unit is the weblog post. A weblog post is a structured document with the following fields: date, timestamp, title, content, permalink and author. We present our approach for segmenting weblogs into posts, which br...

متن کامل

Envisioning With Weblogs

In this position paper we present a vision of how the stories that people tell in Internet weblogs can be used directly for automated commonsense reasoning, specifically to support the core envisionment functions of event prediction, explanation, and imagination.

متن کامل

بررسی محتوای یادداشت‌های ارسالی و نظرات وبلاگ‌های فردی و گروهی کتابداری و اطلاع‎رسانی فارسی

The present study employed a content analysis method for analyzing the posts and comments in 85 individual and 31 collective weblogs published in Farsi on the subject of Library and information science. Studies showed that the average monthly postings in collective weblog are more than individual weblogs, while regarding the comments posted the reverse is true. The highest numbers of postings i...

متن کامل

Weblogs: Technology for Instruction and Learning

As weblogs, in its nascent state, are becoming one of the most participated online activities after web surfing, email, and instant messaging, it has been considered more of a trend in net broadcasting than just a fad. The emergence of bloggers and their behavioral models has opened up new research opportunities in many perspectives. This article attempts are twofold; 1) demonstrate how weblogs...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003